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ABSTFLACT 



J. I 



This thesis compares three types of models developed to predict overhead costs 
for seven government aerospace contractors. The methodologies utilized to develop 
the models include generalized least squares, univariate Bo.x-Jenhins, and multivariate 
Box-Jenkins procedures. The results of those models are compared using three 
measures of effectiveness: correlation coefficient between actual and predicted values, 
root mean squared error divided by the mean of the actuals, and mean absolute 
percentage error (in percent). As was expected, the univariate Box-Jenkins method 
produced short term forecasts which were superior to those of the least squares 
regression models. However, the regression forecasts were highly accurate and were 
considerably less expensive to obtain. Only one multivariate Box-Jenkins model could 
be developed. The results of this model were marginally superior to the related 
regression model and significantly inferior to the univariate Box-Jenkins model for the 
same contractor. 
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I. INTRODUCTION 



Overhead costs often constitute a large portion of total product costs in many 
industries. In the case of government aerospace contractors, overhead costs may 
comprise as much as fifty percent of the contract cost negotiated by the Federal 
government. Historically, overhead costs have been predicted by applying estimated 
overhead rates to estimated labor hours in several categories of operations. These 
category totals are summed to arrive at a total overhead cost for a particular product. 
This procedure of estimating overhead is greatly affected by changes in the level of 
operations. Fluctuating production rates often lead to overhead rates which display a 
lagged relationship between the applied rates and actual overhead costs. Consequently, 
this particular method of predicting overhead costs produces estimates which are 
inadequate. 

Alternative methods for predicting overhead have been proposed. In many cases, 
these methods attempt to establish a direct relationship between costs and factors 
related to direct production. Thus the need for reliance on estimated overhead rates is 
eliminated. Various least squares regression models have been proposed in attempts to 
establish the desired direct relationship. One such model was developed with simplicity 
in application as a strong consideration [Ref T. p.7]. This allowed users of varying 
degrees of statistical familiarity to apply the model in actual work conditions. This 
model intentionally utilized the minimum number of explanatory' variables necessary' to 
achieve accurate results. Several applicable independent variables were considered and 
observations of direct personnel demonstrated the strongest relationship with overhead 
costs [Ref 1: p. 20]. As with any economic trend data, autocorrelation must be 
suspected. The use of quarterly observations in the model required testing for first 
order AR(1) and fourth order AR(4) autoregressive processes. Appropriate tests and 
corrections were incorporated into the model to ensure the absence of bias in the 
standard errors of the coefficients and in the R-squared statistic [Ref 2: p.283]. The 
results of this model are included for comparison purposes. 

A second alternative to the use of estimated overhead rates is the Box-Jenkins 
method of forecasting. This method, which is designed to produce highly accurate 
short term forecasts, allows for a wide range of possible models to apply to a particular 
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economic series [Ref. 3; p. 11]. The degree of statistical sophistication required to apply 
this method is far greater than that required for the regression model mentioned above. 
In addition, the use of the Box-Jenkins forecasting method requires far greater 
computer resources during the three stages of identification, estimation, and forecasting 
than most regression models. Consequently, the Box-Jenkins method has not been 
applied as a forecasting tool in many cases in which it would be a logical alternative. 

The Bo.x-Jcnkins transfer function (multivariate Box-Jenkins method) utilizes 
deviations from appropriate means of an input (X) and of an output (Y) to establish a 
relationship from which forecasts can be made. The connecting tool between these two 
series is a linear differential equation. This extremely complicated forecasting tool 
requires extensive expertise on the part of the statistician. Once again, this highly 
effective forecasting tool has been underutilized. 

This paper attempts to develop usable least squares regression, Box-Jenkins, and 
Box-Jenkins transfer function forecasting models for seven government aerospace 
contractors. The effectiveness of each model is measured by withholding the last four 
data observations during the model development phase and using the model to forecast 
the withheld observations. The deviation of the actuals from the predicted values have 
been indicated with three measures of effectiveness: correlation coefficient between 
actual and predicted values, root mean squared error divided by the mean of the 
actuals, and mean absolute percentage error (in percent.) The development and results 
of each model are included for comparison purposes. 
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II. DATA 



The data have been supplied by seven government aerospace contractors. To 
maintain confidentiality, all references to specific contractors will be with the labels A 
through G. A specific reporting format was utilized by each contractor during data 
collection. Quarterly data spanning the period beginning in the first quarter of 1979 
and continuing through the third quarter of 1986 was requested from each of the seven 
contractors. Usable data were obtained from each contractor for large portions of the 
requested period. However, only one contractor was able to supply all thirty-one 
observations. 

In the reporting format, overhead costs were composed of costs from three major 
categories which had been further refined into sL\ subcategories. The first major 
category, labor related costs, was composed of two subcategories: indirect salaries and 
fringe benefits. The second major category was facility costs. The third major 
category is composed of three subcategories: electronic data processing costs, 

independent research and development and bid and proposal costs, and all other 
overhead costs. 

.All components of overhead costs were converted to constant 1984 fourth 
quarter dollars. The labor related cost categories adjustment was accomplished 
through the application of the Bureau of Labor Statistics SIC 372 price inde.x for 
production worker average hourly wages for the aircraft and parts industiw. In the 
case of this inde.\, monthly indices were averaged to produce quarterly indices. Gross 
National Product Deflater indices published by the Bureau of Economic Analysis were 
applied to the two remaining major categories. Facility costs were adjusted with the 
GNPD gross private domestic fixed nonresidential investment index. The GNPD 
personal consumption services index was utilized to adjust the final major category. .As 
with all indices, those used were imperfect. They were chosen because they provide the 
best adjustments for inflation among all readily available and relevant indices. 

Data pertaining to direct production were obtained from each contractor. The 
only direct production data set utilized in the analysis was direct labor personnel. This 
category of data did not require adjustment for inflation. 
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III. MODEL DEV ELOPMENT 



A, THE REGRESSION MODEL 

The model utilized in this project was designed at the request of the Naval Air 
Systems Command as part of its contractor overhead tracking project. The model 
capitalized on the explanatory and prediction properties of least squares theory. 
Individual regression models were developed for each contractor in an effort to 
accurately predict future overhead costs. Simplicity in application has been a key 
factor in the model's development. 

The application of least squares to economic trend data almost inamediately 
implies autocorrelation in the error terms of the regression. With the presence of 
autocorrelation, the estimates of the coefficients are unbiased and consistent. 
However, they are not effiicient. The estimate for the variance of the coefficients are 
biased. Positively autocorrelated errors produce a coefficient variance which is 
underestimated because of a downward bias in the estimate of the variance. The 
downward bias will produce a confidence interval which is narrower than it should be 
for each coefficient. For this reason, tests of the null hypothesis that the coefficient is 
equal to 0 will be rejected in instances in which it should be accepted. Likewise, 
autocorrelated errors will cause exaggerated R and F statistics when an ordinary least 
squares model is applied. 

The effects of autocorrelation can be eliminated through the application of 
generalized least squares (GLS) procedures. The application of GLS to autocorrelated 
data produces estimators of the coefficients which possess the properties of ma.ximum 
likelihood estimators. Therefore, the GLS estimators of the regression coefficients and 
the variances of these coefficients will be unbiased, consistent, and asymptotically 
efficient. This will lead to more reliable estimates of and F. [Ref 4: pp. 302-31 1] 

This project examined quarterly data. For this reason first order AR(T) and 
fourth order AR(4) autoregressive processes are suspected. AR(1) processes are 
detectable with a Durbin \V'’atson Test. In those cases where AR(1) is present, the data 
has been transformed in the following manner; 

(eqn3.l) 
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(t= 2,3,4 T). - 



where 



where 



T 

^e e 1 
t=2 

= (t=2,3 T) (eqn3.2) 

T-1 
Ve 2 

t=2 

e^ = residuals from the OLS regression. 



In equation 3.2, Pj defined in this manner is the two stage Prais-Winsten 
estimator derived by Park and Vlitchell [Ref 5]. The AR(4) process detected in the 
model, a special form of the general AR(4) process, is written as 






(eqn 3.3) 



The general form for AR(4) processes is 

^t P l^t- 1 P2^t-2 ^3^t-3 P4^t-4 



(eqn 3.4) 



In both cases 

Uj ~ N(0 ,ct2). 

In this model, the effects of the three prior quarters are assumed to be negligible 
while the effect of the quarter one year previous is considered significant. This special 
form of AR(4) process is detectable with a Wallis Test [Ref 6]. If AR(4) processes 
were detected, the data sets were transformed in the following manner 



=Zj(l - p4“)‘'‘- 

(t= 1.2, 3,4), 
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(eqn 3.6) 



Zj Zj - pqZj. j 

(t = 5.6.7 T).- 



where 



P4 = 



T-(l - .Sdj) ^ 



T- 




(eqn 3.7) 



In equation 3.7. T is equal to the number of observations. K is the number of 
parameters to be estimated, and d^ is the Wallis Test Statistic written as 



where 



T 

I (et - e,.4)2 

t = 5 

dq = (t=l,2 T) (eqn 3.8) 

T 

t= 1 

Cj, = residuals from the OLS regression model. 



This particular estimator, derived by Theil and N'agar [Ref. 7; p. 287], was found 
to be the most efficient among nine alternative estimaters applied directly to these data 
sets. [Ref. 8: p. 49] 

After each OLS model was computed, the Wallis and the Durbin Watson 
Statistics were examined for the presence of .AR{4) or AR(1) processes. Appropriate 
transformations were made and the models were reestimated. At the conclusion of the 
transformation and reestimation phase, all traces of autocorrelation had been removed 
.Tom the GLS models. In each case, the residuals were examined and all tests for 
normality were accepted. 

The following procedure was applied to each of the seven contractors. A detailed 
presentation of the results will be made for contractor A. The results from the 
remaining contractors will be made in a summarized fashion with appropriate 
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comments. Direct comparison of the regression models between two contractors does 
not indicate relative efficiency. .Aside from the organizational differences within each 
firm, each of these contractors specializes in a particular branch of the aerospace 
industry. E.xamples include Rxed wing aircraft and aircraft engine producers. For 
these reasons, comparison of the models between contractors is inappropriate. 

Contractor .A supplied data spanning twenty-six quarters. Therefore, all three 
models; regression. Box-Jenkins. and Box-Jenkins transfer function, are based on 
twenty-two observations. The remaining four observations are withheld during the 
model development stage and are used for comparison purposes with predicted values 
during the forecasting stage. The results of the OLS and GLS models are presented in 
Table 1. The adjusted R“ value of .1S2S and the F value of 5.93 indicate that the OLS 
model is poor. The Wallis Statistic of .5S01 indicates the presense of the special form 
of AR(4) suspected in the data. The appropriate transformation to correct for AR(4) 
was made and the model improved significantly: adjusted R^ = .8718 and F= 150.6569. 
.At this point, the residuals indicated the effects of first order autocorrelation. .A 



second transformation was made and the model improved slightly: adjusted R“ = .8787 
and F= 160.3358. Both forms of autocorrelation had been completely removed at the 
completion of the second transformation. 



TABLE 1 

REGRESSION MODELS FOR CONTR.ACTOR A 



OLS GLSAR(4) GLSAR(l) 



.Adjusted R“ 
F Statistic 
Intercept; 



54778.277 

68215.152 



.183 

5.921 



150.657 

11487.623 

5924.463 



.872 



160.336 

3047.305 

2438.277 



.879 



Standard Error; 
Slope: 



13.960 

5.737 

1.618 

.580 

.187 

.723 



15.701 

1.279 

.644 



15.946 

1.259 

1.991 

1.432 

.080 

.294 



Standard Error: 
Durbin Watson Statistic: 
Wallis Statistic: 

Estimate of p j 
Estimate of p_j 



1.351 



.694 

.334 
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The regression models for the remaining six contractors are depicted in Table 2. 
A single transformation was required to eliminate dll indications of autocorrelation in 
the cases of contractors B and C. The required transformation removed the effects of 
an AR(4) process. The data supplied by Contractor E was free of the effects of 
autocorrelation and the model listed in Table 2 is the OLS model developed for this 
contractor. The remaining contractor data sets were transformed twice to achieve the 
desired residual characteristics. All of the models derived during this stage possess 
good forecasting capabilities when utilized to predict the four data points which were 
withheld. This will be discussed in the next chapter. 

B. BOX- JENKINS METHOD 

The Bo.x-Jenkins method of forecasting is comprised of three stages; 
identification, estimation, and forecasting [Ref 9: p. 19). In the identification stage, the 
series is often differenced to achieve stationarity about a mean (usually 0). During the 
development of these models, differencing of the order of one period (regular 
differencing) or four periods (seasonal differencing) was considered. As is standard 
with Box-Jenkins methodology, no more than two differencing corrections were 
required for any model [Ref 10: p. 125]. With one exception, a stationary' series was 
achieved through the application of regular, seasonal, or a combination of both types 
of differencing for each of the contractors. Contractor D data were already stationary 
and did not require differencing. The autocorrelation (ACF) and partial 
autocorrelation(PACF) functions of this stationary series are analyzed to determine the 
horizontal subpatterns within the series. The stationary trend can be specified as a 
linear combination of past series values (autoregressive terms), a linear combination of 
past random errors (moving average terms), or a combination of both. The 
determination of the appropriate number and specific lag of the autoregressive and 
moving average terms is made during the analysis of the ACF and PACF. Spikes on 
the .ACF accompanied by trends on the P.ACF resembling exponential decay for the 
same lag indicate the appropriateness of a moving average parameter at that lag. 
Likewise, spikes on the PACF accompanied by an exponentially decaying trend on the 
ACF signal the need to include an autoregressive term at that lag. Spikes on the .ACF 
and P.ACF are taken to be correlation values for a given lag which are statistically 
different from 0. Once the character of the trend is identified, the estimation phase is 
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1 

TABLE 2 




SUMMARY OF CONTIUACTOR REGRESSION MODELS 


REGRESSION MODEL for CONTRACTOR B 


.Model; OVRHD(B) = a+ b 


PERSONNEL(B) 


Adjusted R“ 


.927 


F Statistic 


243.SS1 


Intercept; 


11978.143 


Standard Error; 


5952.942 


Slope; 


15.215 


Standard Error; 


.974 


REGRESSION MODEL for CONTIUACTOR C 


Model; OVRHD(C) = a+ b 


PERSONNEL(C) 


Adjusted R^ 


.840 


F Statistic 


106.326 


Intercept; 


327.038 


Standard Error; 


3911.146 


1 Slope; 


8.702 


1 Standard Error; 


.844 


REGRESSION MODEL for CONTRACTOR D 


.Model; OVRHD(D) = a+ b 


PERSONNEL(D) 


.Adjusted R" 


.726 


F Statistic 


64.448 


Intercept; 


13864.996 


! Standard Error; 


3084.697 


Slope; 


7.999 


, Standard Error; 

1 


.996 



entered to determine the parameter values. A first order autoregressive series is 
expressed as follows 



Zt = 0Q + (pZ^.j + Aj, 



(eqn 3.9) 
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TABLE 2 

SUMMARY OF COXTRiXCTOR REGRESSION MODELS (CONT'D.) 



REGRESSION MODEL for CONTIU^XCTOR E 
Model; OVRHD(E) = a+ b PERSONNEL(E) 



Adjusted R“ 


.527 


F Statistic 


26.590 


Intercept; 


4976.203 


Standard Error: 


1S56S.690 


Slope: 


14.949 


Standard Error: 


2.899 


REGRESSION MODEL for CONTR.'\CTOR F 


Model; 


OVRHD(F) = a+ b PERSONNEL(F) 


Adjusted R^ 


.707 


F Statistic 


63.739 


Intercept; 


884.075 


Standard Error; 


1083.769 


Slope: 


13.012 


Standard Error; 


1.629 



REGRESSION MODEL for CONTR.^CTOR G 
Model: OVRHD(G) = a+ b PERSONNEL(G) 



Adjusted R“ .857 

F Statistic 133.155 

Intercept: 4716.544 

Standard Error: 3020.132 

Slope; 20.545 

Standard Error: 1.780 



where 

0Q= series mean 

(Pj = weighting of the previous period value 
Aj= white noise ~N(0,<r^"). 
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An alternative form of the model in terms of deviation from the series mean is 



where 




(pZ^.l+ 




- Z^-0Q. 



(eqn 3.10) 



This e.xpression can be expanded to include any number of past series values and is 
written as follows: 



Zt = (piZj.j ' +<p 2 Zj .2 +•••■*• (ppZj.p +Aj. (eqn3.ll) 

The backward shift operator, B. is often utilized to express the relationship of the 
present term to the relevant past terms. The operator is a symbolic indicator and does 
not imply multiplication of Z^ by a constant B. Its use indicates a desire to express 
past consecutive terms of a series as an ordered group. The operator possesses the 
following relationship: 



BZ, 



-t-1 



B^Zj — Zj_') 

B’^Z = Z 



Using the operator, the autoregressive process of order p is expressed as 

<p(B)Z^ = Aj (eqn 3.13) 

with 

(p(B) = (1 -<PjB-<P 2B2-...- (ppBP). 

The moving average model assumes that the series can be expressed as a 
weighted average of past successive white noise terms. A first order moving average 
model expressing the series as deviation from the mean of the series is: 

Zj = (eqn 3.14) 

where 

= series is white noise ~ N(0,<r j^^) 

0j = coefficient of the most recent white noise term. 



18 



This basic relationship can be expanded to include any number of past terms. .A 
moving average process of order q is written as: 

"" '"'"t*^K^t-r® 2 ^t- 2 ‘--'®q‘‘^t-q- (eqn 3.15) 

Using the backward shift operator this model is expressed as 

Zj* = 6 (B)A^ (eqn 3.16) 

with 

0(B) = i-eiB-e2B2-....e^B^. 

.A combination of these types of models can be developed and possesses 
relationships to past series terms and to past white noise elements. The process is 
called a mixed autoregressive-moving average model and is expressed as; 

Zt" = ^Pl^t-l' +- + <PpZt.p“ + A^-0lA^.l-...-0q.A^.q. (eqn 3.17) 

This expression is usually written in terms of the backward shift operator, 

(p(B)Zj'' = 0(B)A^. (eqn 3.18) 

Models which adequately describe the data rarely exhibit values of or <7 greater 
than two and usually are less than two. [Ref. 10: p. 66 ] 

The use of differencing to achieve a stationary' series permits the use of the Bo.x- 
Jenkins method to model series which are nonstationary in nature. The backward 
differencing operator V is used to indicate the following relationship; 

VZj = Z^-Zj., = (l-B)Z^. (eqn 3.19) 

The model now becomes an autoregressive integrated moving average process 
(ARl.M.A) of order (p,d,q.) and is written: 

{p(B)V‘^Zj = 0(B)A^. (eqn 3.20) 
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In this relationship the superscript d indicates the number of times that regular, 
backward differencing was utilized to achieve a stationary’ series. 

Seasonal differencing vvas used in several of the models to achieve stationarity. 
The inclusion of seasonal differencing produces a seasonal ARIMA model described as 
(p,d,q) X (P,D,Q,)j. The subscript s indicates the number of periods contained in one 
season. In the examination of quarterly data, there are four periods in one year and 
the subscript becomes a four. The general form of the seasonal ARIMA model is a 
multiplicative ARIMA model. The multiplicative nature of the model indicates that 
the model contains terms which are products of the regular and seasonal coefficients. 
Intuitively, this makes sense. In the case of quarterly data, the value in the series five 
periods previous to the present is included and has a coefficient which is the negative 
product of the first order term and the seasonal term [Ref 10: p. 164]. A multiplicative 
.A.RIMA model of order (1,1,0) x (1,1,0)^ is expressed as: 

Zt = ^Z^-i ■^‘^iZj. 4 -q)i<I’iZj .5 + Aj (eqn 3.21) 

with 

A^ ~X(0,cr^-) . 

<I>j = coefficient for the seasonal term. 

The general expression for the multiplicative ARI.VIA model is 

(Pp(B)(Dp(B3)V'^V3Dz^ = 0q(B)0Q(B5)A^ 

(eqn 3.22) 

where 

Oq = coefficient of the seasonal error term 

(Dp = coelTicient for the seasonal term. 

Several plausible models are developed during the estimation stage and these 
alternative models can be compared utilizing the diagnostics provided in most 
computer packages. The parameters and the associated residuals from each plausible 
model w'ere examined to validate the model. The residuals were examined for the 
presence of bias and autocorrelation. The three indicators used to detect the presence 
of these conditions were the residual mean and variance, the autocorrelations of the 
residuals, and the Q statistic. The model parameters were examined for statistical 
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significance and indications of high correlation between each other. Highly correlated 
parameters are usually an indication of the inclusion of unnecessary parameters in the 
model [Ref 10: p. 98]. The desired outcome is to determine one or more models which 
produces fitted values as close as possible to the original series. Additionally, it is 
desired that the models have as few parameters as possible [Ref 9: p. 17]. The final 
stage of the Box-Jenkins method, forecasting, enables the user to project the series into 
the future. Often. 95% confidence intervals for the projected values are provided by 
the computer package. The computer package utilized during this analysis was 
GR.AFSTAT. It is a package being developed by IBM and is installed at the Naval 
Postgraduate School for evaluation. 

In general, the results derived from the Box-Jenkins method are usually more 
accurate in the short and intermediate term than forecasts from other methods, 
including regression [Ref 11; p. 236]. The cost of these generally superior results are 
measured in the computer resources required to derive the model and the expertise 
required of the statistician to determine an appropriate model. The procedure allows 
for interpretation on the part of the forecaster. Two forecasters may identify different 
models as being the best model to fit the same data set. Even so, both sets of forecasts 
may be highly accurate when compared to the future observations of the series 
[Ref 3: p. 11]. 

The Box-Jenkins models developed for each of the contractors displayed strong 
predictive properties when used to predict the missing values. Table 3 presents the 
differencing required to achieve stationarity and the final form of the model for each of 
the contractors. The model for each contractor is expressed in standard Box-Jenkins 
notation with a seasonal period of four. The coefficient values, the model mean, and 
the standard errors of these terms are included in the presentation. 



C. BOX-JENKINS TRANSFER FUNCTION 

The Box-Jenkins transfer function is a procedure which allows a forecaster to 
aggregate the information contained in a particular series (output) with one or more 
related series (input) to forecast future values of the series. The relationship which is 
usually identified is that the trend present in the input series is reflected in the output 
series after a lag of several periods. Relationships of this order are referred to as 
dynamic responses. The aggregation of information is achieved through a transfer 
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TABLE 3 

SUMMARY OF BOX-JEXKIXS MODELS 



Summary' For Contractor A 



ARIMA 

AR(1.) 

Standard Error 
Mean 

Standard Error 



(1,0,0) x(0,I.0.)4 

.753 

.157 

2023.270 

6648.622 



Summary' For Contractor B 



ARIMA 

AR(1) 

Standard Error 
Mean 

Standard Error 



(1,1,0) X (0.1, 0,)4 
-.845 
.171 
488.515 
1162.532 



Summary For Contractor C 



ARIMA 

AR(1) 

Standard Error 
SAR(l) 

Standard Error 
Mean 

Standard Error 



(1,2.0) x(i, 0,0)4 
-.673 
.185 
.764 
.103 
305.319 
416.400 



Summary For Contractor D 



ARIMA 

AR(1) 



Standard Error 
.Mean 

Standard Error 



(1,0,0) x(0,0,0.)4 
.688 
.163 

189328.647 

6599.820 



TABLE 3 

SUMMARY OF BOX-JEXKIXS MODELS (COXT'D.) 



Summary’ For Contractor E 



A RIM A 
MA(1) 

Standard Error 
Mean 

Standard Error 



(0, 1.1) X (0,0, 0)4 
.906 
.059 
1532.200 
397.637 



I 



i 

i 



Summar\’ For Contractor F 



ARIMA 

AR(1) 

Standard Error 
SAR(l) 

Standard Error 
.Vlean 

Standard Error 



(1.1.0)x(l.l,0.)4 

-.607 

.166 

.875 

.129 

-350.716 

2534.731 



Summarv’ For Contractor G 



ARIMA 

AR(1) 



Standard Error 
.Mean 

Standard Error 



( 1.0,0) x(0, 1,0)4 
.854 
.103 
1125.453 
18744.529 



function model [Ref 9: p. 13]. This procedure is comprised of the same three phases as 
the univariate Box-Jenkins method. During the identification stage, differencing of the 
series is usually recommended in an effort to achieve a stationary series. However, a 
linear combination of elements in the output series often may be stationar>’, and 
differencing of all of the series in the model can cause complications in identifying the 
appropriate model [Ref 12]. 

The data trends may be tentatively identified and prewhitened in an effort to 
achieve an input series which strongly resembles white noise. This effort is made in an 
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attempt to improve the interpretability of the cross correlation function (CCF). If the 
input series is not prewhitened, the CCF often cannot be interpreted [Ref 13: p. 243]. 
A series which can be represented by an .ARI.MA model 

= {l-(piB-...-(PpBP)*l(l-ejB-...-eqB^)Aj. (eqn3.23) 



or as expressed with the backward shift operator 

=((),,■ 1{B)0^{B)A^. (eqn3.24) 

can be prewhitened by inverting the model. The prewhitened version of the series is 
expressed as follows: 



Qj = (p,^(B)e,^-l (B)Xj. (eqn3.25) 

The same transformation is applied to the output series culminating in the following 
expression: 

B^ = (p^(B)6,^-l (B)Zj. (eqn3.26) 

The relationship between the input and output series is described in an impulse 
response function of the form 

Zj = UqX^ + UjXj.j + u ')Xj. 2"^ •••■'■ Ht (eqn 3.27) 

where 

u is the impulse response 
is the random noise term. 

Expressing the impulse response function in terms of the prewhitened series, and B^, 
the function becomes 



B^ = u(B)aj + Cj. (eqn 3.28) 

where is the transformed noise series defined by 
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= <Px(B)e^'^ (B)nf. (eqn3.29) 

The impulse response weights can be determined through examination of the 
CCF. The following relationship is utilized in this regard: 



= k = 0,1,2.... {eqn 3.30) 

S|-j 

where 

P^B(^) “ correlation coeiTicient between the 

ttj. and Bj series for the element 
= estimated standard deviation of the series 

Sg = estimated standard deviation of the B^ series. 

Once estimates of are obtained, identification of those elements of the impulse 
function which are statistically significant enables the forecaster to identify an 
appropriate model to be used as a basis for the transfer function. From these 
estimates, simultaneous equations can be formed and solved to provide preliminary 
estimates of the parameters of the model. Since the impulse responses provided by the 
CCF are statistically inefficient in general, the proposed model is used as a starting 
point to be fitted by some more elaborate means. 

This procedure is extremely complicated and places a high demand on computer 
resources and the skills of the forecaster. The results should be indicative of the cost of 
attaining them. In this particular application of the procedure, several difficulties were 
encountered. A computer package to perform the entire procedure was not available. 
The GRAlFSTAT package mentioned previously does contain a CCF routine. 
Therefore the inefficient estimates of the impulse responses obtained from the 
examination of the CCF could be used to form simultaneous equations. However, this 
would be the most elaborate means of determining the parameters. The data from 
each contractor were examined in all combinations of undifTerenced. regularly 
differenced, and seasonally differenced forms. In all cases, the data were prewhitened 
before examination. Only one CCF indicated the presence of an impulse response 
value which was statistically different from zero. This was the prewhitened and 
undifTerenced series for Contractor D. Therefore, the only input value used in the 
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forecast of the output was the present input value. The nature of this model led to a 
single equation of the following form: 



Oq = 14.39 = cOq. 



(eqn 3.31) 



From equation 3.31, the final multivariate Bo.x-Jenkins model was determined to 



The model parameters are listed in Table 4. The forecasts made with this model are 
inferior to those obtained from the univariate Bo.x-Jenkins model. As will be discussed 
in the results chapter, the presence of the input series (X^) does not produce a 
regression model or a transfer function which outperforms the univariate Bo.x-Jenkins 
model for contractor D. The inability of the procedure to develop adequate models for 
the majority of the contractors may be attributable to the relatively small sample size 
of each data set. The longest series available for analysis was twenty-seven 
observations and the majority of the contractors supplied data in the range of twenty 
to twenty-four observations. Generally, sample sizes larger than sixty observations or 
data spanning at least eight complete seasonal periods are recommended as a minimum 
number of data points for the method to perform well [Ref 3: p. 6]. 



be; 



\\ = 14.39 Xj. 



(eqn 3.32) 



TABLE 4 

SUMMARY FOR CONTRACTOR D 




.49 

1034.734 

30387.40016 

14.39 



26 



IV. PREDICTION METHODOLOGY AND RESULTS 



Each model was used to predict the four withheld data points. For the GLS 
models actual X's for the four periods were available and used. In those cases where 
only one form of autocorrelation was present in the data, a transformation of the 
following type was made to the original data prior to estimation: 

Yt" ='pi^'t-i + (\-'Pi'»Vi) (eqnd.l) 

(t = T-3,T-2.T-i,T) 

(i= 1 or4) . 



The majority of the GLS model required two transformations to remove all forms of 
autocorrelation. In those cases, the X's and Y's were transformed in the following 
manner before the final GLS model was used to predict the withheld values. 

Transform the data for AR(4): 

Y,* = Y,.'p4Y,.4 

X,‘-X,.P 4 X ,.4 (eqn4.2) 

(t = 5,6,7,. ..,T) . 



Transform these new data to remove the effects of AR(1): 



jJ: /N * 

Yt 

❖ jjs y\ ''c- 

Xt =X,.-'?iX,.i 

(t=l,2,3,...,T). 



(eqn 4.3) 



Substitution of equation 4.2 into equation 4.3 allows both equations to be 
combined as follows: 



Y,"=Y,.$4Y,.4-p'|(Y,.,^P4Y,.5) 

Xj-"= X,-'P 4 X,. 4 ^ 1 (X^.,^ 4 X,. 5 ) (eqn 4.4) 

(t = 6,7,8 T) . 
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These equations can be simplified to the following form: 




(eqn 4.5) 



(t= 6.7,8,.. .,T). 

Assuming that all forms of autocorrelation have been removed, the following 
relationship holds: 



The entire transformation can be made in one step and forecasts can be made 
with the following equation. 



This set of equations pertained to the instance in which AR{4) was removed first and 
AR(l) was removed during the second application of GLS. These equations were 
developed by substituting the transformation for AR(4) into the equation for the 
removal of AR(1) processes. A similar expression was derived for those cases in which 
the correction for AR(1) preceded the removal of AR(4) processes. In the case of the 
Y's, this procedure required an iterative process to determine the last three values in 
the Y vector. 

The Bo.x-Jenkins models developed for each contractor were used to produce 
forecasts which were compared to the withheld data points. These forecast values were 
provided by the GR.AFSTAT package. The package also provided ninety-five percent 
prediction intervals for each forecast. A graphical presentation of the forecast is 
provided with an analysis of each contractor's data in the remainder of this chapter. 

Forecasts were made with the multivariate Box-Jenkins model. These were made 
by substituting the known values in the input (Xt) series into equation 3.32 and 
completing the computations. 




(eqn 4.6) 



i)j ~ N(0,ff") 

(t = 6,7,8,...,T) . 



Yt = PiY^.i + P4Yj.4-piP4Y^.5 + 

(X^-PiX^.fp4Xt.4-^1^4Xt-5)B 

(t = 6,7,8,..., T) . 



(eqn 4.7) 
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The forecasting capabilities of each model weje measured by three comparison 
indicators: correlation coefficient between the actual and predicted values, root mean 
square divided by the mean of the actuals, and the mean absolute percentage error. 
An analysis of the data and a presentation of the prediction results for each contractor 
follows. 

CONTRACTOR A 

Contractor A supplied twenty-two data points. The data was categorized and 
deflated to constant dollars as specified in Chapter II. Graphical presentations of the 
raw data is presented in the top two graphs of Figure 4. 1 . The upper left hand graph 
is a presentation of the overhead cost across time (twenty-two consecutive quarters). 
Likewise, the upper right hand graph is a display of the direct personnel trend across 
time. The overhead cost versus time graph displays a sharp decline in the first four 
quarters and a somewhat cyclic increase thereafter. The trend is increasing in general. 
The direct personnel versus time graph indicates a similar trend in general, but does 
not appear to be influenced by a seasonal trend to the extent that the overhead cost 
trend is. The lower left hand graph displays the relationship of overhead cost versus 
direct personnel. The weak relationship is depicted graphically and in the results of the 
OLS regression (adjusted = .183). The lower right hand graph in Figure 4.1 shows 
the relationship of overhead cost to direct personnel after both series have been 
transformed to remove the effects of autocorrelation. The adjusted for this model 
is .8787 as indicated in Table 3. The graphical portrayal of the transformed data 
suggests that the GLS model should be a dramatic improvement over the OLS model 
as a prediction tool. A summary of the predictive results of the GLS model is 
presented in Table 5. Actual X's were available and used in making predictions for the 
four withheld quarters. 

The Box-Jenkins model developed for Contractor A was presented in Chapter 3. 
A topic of concern was the amount of data available which could be used as a basis for 
the model. Generally, the amount of data needed to develop accurate models is fifty 
observations with one hundred preferred. Models can be developed in the absence of 
these am*ounts of data, but the forecaster must utilize experience and past information 
to develop preliminary models which can be updated as more iriformation becomes 
available [Ref 9; p. 18], A graphical portrayal of the results of the Box-Jenkins model 
is presented in Figure 4.2. The left hand graph is the actual overhead series including 
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Figure 4.1 Regression Analysis Graphs for Contractor A. 
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Figure 4.2 Box-Jenkins Graphs for Contractor A. 
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TABLE 5 

PREDICTION RESULTS FOR CONTRACTOR A 





REGRESSION 


BOX- 

JENKINS 


Correlation Coefficient: 


.862 


.881 


Root .Mean Squared Error 


.Mean of the Actuals: 


.0649 


.0376 


Mean .Absolute Percentage 
Error: 


5.98 


2.79 



the periods for which predictions were made. The right hand graph is the first twenty- 
two observations and the four prediction values. The trend of the predicted values can 
be compared to the actual trend by mentally superimposing one graph on the other. 
As indicated in Table 5, the results of the Bo,\-Jenkins model for Contractor A is 
superior to the results of the GLS model. 

Attempts to develop a multivariate Box-Jenkins model were made and proved to 
be unsuccessful. The cross autocorrelation function (CCF) was plotted. However, 
none of the impulse weights were statistically significant. All reasonable combinations 
of difierencing were examined in addition to a CCF in w'hich no differencing was 
included. The outcome was the same in all cases. 

B. CONTRACTOR B 

Figures 4.3 and 4.4 and Table 6 are provided for Contractor B. Figure 4.3 
displays the raw overhead cost and direct personnel series for Contractor B. As 
indicated in the upper right hand graph, the overhead series has an increasing trend 
accompanied by a seasonal variation. The direct personnel series is characterized as a 
consistently increasing series. A comparison of the slopes of the series in each graph 
indicates that the quarterly direct personnel count is increasing at a slightly greater rate 
than the overhead cost per period is. The overhead cost versus direct personnel chart 
reveals the presence of a relationship between those two series which is stronger than 
the near randomness revealed in the same graph for Contractor A. This is supported 
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Figure 4.3 Regression Analysis Graphs for Contractor B. 
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Figure 4.4 Box-Jenkins Graphs for Contractor B. 
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TABLE 6 

PREDICTION RESULTS FOR CONTRACTOR B 





REGRESSION 


BOX- 

JENKINS 


Correlation Coefficient: 


.584 


.675 


Root Mean Squared Error/ 


.Mean of the Actuals: 


.0952 


.0433 


Mean Absolute Percentage 
Error: 


8.58 


3.91 



by the results of the OLS model which are surprisingly good with an adjusted R of 
.669. After transformation, the GLS model achieves an adjusted R“ of .927. There are 
three transformed observations which are easily distinguished from the remainder of 
the transformed data. These observations are located closely together on the 
transformed data graph. The GLS model developed possesses all of the indications of 
a statistically significant and worthwhile prediction model. The adjusted R“, F- 
statistic, and T-statistic for the slope are all significant (Ref 14: p. 133]. Despite this 
fact, the Box-Jenkins model developed for Contractor B is superior as a forecasting 
tool in the range which is being examined in this paper. The multivariate Box-Jenkins 
model suffered from the same short comings as the model for Contractor A. The 
impulse weights were determined to be statistically insignificant. 

C. CONTRACTOR C 

Figures 4.5 and 4.6 and Table 7 are provided for Contractor C. The overhead 
cost and direct personnel graphs derived from Contractor C data are presented in 
Figure 4.5. The overhead cost trend appears to fluctuate significantly about a mean 
value of approximately 590,000,000 during the first seventeen quarters with the 
minimum value of the series occurring in the fifteenth quarter. The series shows a 
departure from this trend during the last eight quarters. The last four values depicted 
on the top two graphs are values which were not included during the model 
development stage. These are the values which are being withheld for prediction 
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Figure 4.5 Regression Analysis Graphs for Contractor C. 
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Figure 4.6 Box-Jenkins Graphs for Contractor C. 
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TABLE 7 

PREDICTION RESULTS FOR CONTRACTOR C 



BOX- 

REGRESSION JENKINS 



Correlation CoefFicient: 
Root Mean Squared Error 



.671 



.338 



.Mean of the Actuals: 
Mean .Absolute Percentage 



.112 



.117 



Error: 



8.52 



9.10 



comparison purposes. The direct personnel graph shows an increase during the first 
nine periods followed by a decrease for approximately eight periods. At that point, the 
number of direct workers employed at Contractor C increases for the remaining eight 
periods. This trend matches the overhead cost trend in general for the entire length of 
the data strings, but does not appear to be influenced by a seasonal component. The 
plot of overhead cost versus direct personnel appears nearly random (adjusted R^ = 
.291). The plot of the transformed data displays a strong direct relationship between 
the two variables. .As the number of personnel increases, the overhead cost increases. 
This is apparent in the GLS model which has an adjusted R of .840 and an F statistic 
of 106.326. The Box-Jenkins model graphs are presented in Figure 4.6 . These graphs 
are difficult to mentally superimpose on each other. This problem is caused by the 
bounds of the 95% confidence interval for the forecast observations. In the case of 
Contractor C, the regression model produces predictions which are actually closer to 
the actual values than those calculated by the Box-Jenkins model. .A comparison of 
the predictive results is presented in Table 7 . The multivariate Bo.x-Jenkins model was 
unusable for Contractor C. 

D. CONTRACTOR D 

The data supplied by Contractor D spanned twenty periods and is presented in 
Figures 4.7 and 4.8 and Table 8. The overhead cost versus time plot indicates that the 
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Figure 4.7 Regression Analysis Graphs for Contractor D. 
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Figure 4.8 Box-Jenkins Graphs for Contractor D. 
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TABLE 8 

PREDICTION RESULTS FOR CONTR.ACTOR D 





REGRESSION 


BOX- 

JENKINS 


TRANSFER 

FUNCTION 


Correlation Coefficient: 


-.757 


.641 


.418 


Root Mean Squared Error' 


Mean of the .Actuals: 


.197 


.0435 


.126 


-Vlean .Absolute Percentage 
Error: 


18.64 


3.54 


12.18 



series started at a relatively low point and increased rapidly in the fifth quarter. The 
trend remained relatively constant for six periods at which time it began to decrease 
slowly. This trend remained consistent through the twentieth quarter which is the last 
period included in the model development stage. The direct personnel series is similar 
to the overhead cost trend with an abrupt decrease in the thirteenth quarter. 
Beginning in the twenty-first quarter, the direct personnel trend becomes flat and 
neither increases nor decreases for the remainder of the series. These last four values 
are the known X's which are used in the regression model and the multivariate Box- 
Jenkins models. As the X's (direct personnel) become a level function in the prediction 
interval of the data, the Y's (overhead cost) suddenly increases. This departure from 
the previous trend causes problems with both of the models which utilize the available 
X values to generate predictions. The effect of this departure from the trend is 
indicated by the relatively poor results of these models as listed in Table 8. The 
multivariate Box-Jenkins model is predicated on a single impulse weight which was 
significant. This occurred at a lag of zero periods. The model which was developed 
utilized only the present period X value to predict the value of Y. Therefore, this 
model was susceptible to the trend departures present in the data. 

E. CONTRACTOR E 

Figures 4.9 and 4.10 and Table 9 are provided for Contractor E. The regression 
model developed for Contractor E is unique in the fact that it did not need to be 
corrected for autocorrelation. As indicated in the graphs in Figure 4.9, both the 
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Figure 4.9 Regression Analysis Graphs for Contractor E. 
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Figure 4.10 Box-Jenkins Graphs for Contractor E. 
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TABLE 9 

PREDICTION RESULTS FOR CONTR-‘\CTOR E 

1 

j 

! 


CONTR.ACTOR E 

REGRESSION 


BOX- 

JENKINS 


Correlation Coefficient: -.434 


SO 

o 

so 

1 


Root .Mean Squared Error' 

1 Mean of the Actuals; .107 


.137 


i Mean Absolute Percentage 

Error: 8.99 


13.83 



overhead cost and the direct personnel series are generally increasing with time. The 
overhead cost series tends to fluctuate above and below the general trend throughout 
the length of the series with the most significant departure from the general trend 
occurring in the twenty-first to the twenty-fifth data observations. The direct 
personnel series increased initially in the first ten periods and remained fairly constant 
at a level of approximately 6500 workers for the next ten periods. This series shows a 
sharp increase in the twenty-first period and continues to increase at a somewhat 
slower rate thereafter. The overhead cost versus direct personnel graph displays a 
direct relationship between these two variables in the absence of an autocorrelation 
correction. The strength of this relationship is less apparent as the number of 
personnel increases. The four observations in the upper right hand corner of this 
graph are the last four observations in each series in the model formulation range. 
Their proximity to each other is a function of the fact that the direct personnel series is 
slowly increasing in the twenty-first through the twenty-fourth observations. This 
explains the remoteness of their placement on the graph as they occur later in time 
than the large increase in the number of workers that was recorded in the twenty-first 
quarter. The appearance of these points as a nearly vertical line is a function of the 
small increase in the direct personnel component of the graph and the large 
lluctuations that occurred in the overhead cost series in the twenty-first to twenty- 
fourth quarters. The OLS model has an R“ of .527 and an F statistic of 26.59. 
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The ARIMA model for Contractor E was also .unique in the fact that it was the 
only contractor model which utilized a moving average (MA) process to describe the 
data trend. This model was difficult to identify and was actually determined through a 
process of elimination. Ever>’ reasonable model was analyzed and the ARIMA 
{0,1,1) X (0,0,0)_j was chosen on the basis of the statistical significance of the coelTicient 
and the model's performance as a forecasting tool. Several models which actually 
produced slightly better prediction results were excluded from consideration because 
the resulting coeflicients were not significant. The multivariate Box-Jenkins model 
could not be developed. The prediction results of the regression and Box-Jenkins 
models are presented in Table 9 . 



F. CONTRACTOR F 

Contractor F was the only contractor to supply data covering the entire 
observation period. Figures 4.11 and 4.12 and Table 10 are provided for Contractor F. 
The overhead cost and direct personnel series display similar trends. Both series 
increase until the ninth period, then decrease in general and eventually resume 
increasing. The overhead cost trend resumes increasing in the sixteenth quarter. This 
latter trend in the overhead cost series displays a seasonal fluctuation with the 
generally increasing trend. The direct personnel series decreases until the sixteenth 
period at which point it becomes a nearly constant function for six quarters. Beginning 
in the twenty-fourth quarter, the number of workers begins a rapidly increasing trend 
and continues in this manner through the remainder of the series. Several significant 
single increases and decreases occur within the direct personnel trend. Three of these 
rapid changes in the number of direct workers employed by Contractor F border above 
and below the 3600 to 3800 interval in the work force level. The absence of data 
observations in this range creates two distinct clusters of observations in the overhead 
cost versus direct personnel graph. This plot does not appear random because of the 
blank interval between the two groupings. However, the plot does not display a strong 
relationship between the components either. The OLS model has an R^ of .317. Both 
forms of autocorrelation, AR(1) and AR(4), were removed from the raw data through 
transformation and the resulting series displays a direct relationship as depicted in the 
lower right hand graph of Figure 4.11 The application of GLS to the raw data 
improves the model significantly. 
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Figure 4.11 Regression Analysis Graphs for Contractor F. 
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Figure 4.12 Box-Jenkins Graphs for Contractor F. 
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TABLE 10 

PREDICTION RESULTS FOR CONTR.-^CTOR F 





REGRESSION 


BOX- 

JENKINS 


Correlation Coefficient; 


.623 


.440 


Root .Vlean Squared Error/ 


Mean of the Actuals: 


.0886 


.0823 


.Vlean .Absolute Percentage 
Error: 


8.23 


6.86 



The Box-Jenkins model appears to approximate the trend well but underestimates 
the magnitude of the actual values. This model is approximately equal to the GLS 
model in prediction power. The attempted development of the multivariate Box- 
Jenkins model for Contractor F failed to indicate a significant impulse weight. 
Therefore, the development of the model was not pursued. 



G. CONTRACTOR G 

The data supplied by Contractor G spanned twenty-four quarters. The graphs in 
Figure 4.13 indicate that both the overhead cost and direct personnel series follow 
similar patterns during the twenty-four quarters. There is an increasing trend which 
becomes a decreasing trend in the vicinity of the seventh quarter for both series. The 
overhead cost series displays the influence of seasonal fluctuations about the general 
trend. The lower left hand plot of overhead cost versus direct personnel indicates that 
the uncorrected data possess a noticable direct relationship to each other. The R^ 
value for the OLS model was .541. After the data were corrected for both AR(1) and 
AR{4) processes resulting in a GLS model with an R^ of .862. The F statistic for this 
model was 133.155. Table 11 illustrates the outstanding predictive results of this 
model. The Box-Jenkins model for Contractor G is portrayed graphically in Figure 
4.14. The scaling of the two graphs is different because of the inclusion of the ninety- 
five percent confidence intervals in the forecast graph. As indicated in Table 11, the 
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Figure 4.13 Regression Analysis Graphs for Contractor G. 
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Figure 4.14 Box-Jenkins Graphs for Contractor G. 
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TABLE 11 

PREDICTION RESULTS FOR CONTRACTOR G 



1 

1 


REGRESSION 


BOX- 1 

JENKINS 1 


Correlation Coefficient: 


.638 


.604 1 


Root Mean Squared Error/ 


Mean of the Actuals: 


.0262 


.0394 


Mean Absolute Percentage 


Error: 

1 

1 

1 


1.96 


3.15 



model is an elTective forecasting tool for the specified prediction range. However, the 
regression model is superior in all three categories of comparison. The multivariate 
Box-Jenkins model could not be developed for Contractor G. The recurring problem 
of insignificant impulse weights plagued this data set as it has several others examined 
in this thesis. 
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V. CONCLUSIONS 



The intent of this project was to develop and compare forecasting models to be 
used in the prediction of overhead costs for seven government aerospace contractors. 
This is part of a continuing overhead tracking project at the Naval Air Systems 
Command. Three types of methodology were considered as possible model sources. 
These included least squares regression models, Box-Jenkins methods, and Box-Jenkins 
transfer functions. 

The regression models which were e.xamined were developed to be used by 
unsophisticated forecasters and operate well on a microcomputer. The characteristics 
of the data suggested the complications in model development which are associated 
with autocorrelation. Consequently, these models test for two forms of autoregressive 
processes; AR(1) and AR(4). Any necessary adjustments are made within the model. 
.A review of several literature sources indicated that the predictive capabilities of 
regression models are usually inferior to those of the Box-Jenkins method in the short 
run. In general this was found to be true. However, the regression models performed 
well in most cases and resulted in superior forecasting models for two of the 
contractors. 

The Bo.x-Jenkins class of models is considerably more complex than least squares 
regression. Therefore, a more e.xperienced forecaster and more efficient computer 
resources are required to employ this method. The majority of the Box-Jenkins 
transfer function models could not be developed for the data in this project. The 
difficulties encountered are probably due to the small sample size for each data set. 
Additionally, a computer package was not available which could perform the entire 
transfer function procedure. 

In those cases in which these requirements could be fulfilled, either of the Box- 
Jenkins methods should be utilized. In the absence of continuous access to such 
capabilities such as is the case in the Naval Air Systems Command overhead tracking 
project, least squares regression theory can be used to develop very adequate models 
which produce results which are usually far superior to the results obtained from the 
prevailing practice of applying estimated overhead rates to estimated direct labor 
hours. 
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APPENDIX 

APL FUNCTIONS 



The following APL functions perform the OLS and GLS regressions. The GLS 
function transform.s the data to remove AR(4) processes and GLSD makes the required 
transformation for AR(1). The PRED function computes the prediction effectiveness 
measurements for the OLS, Bo.x-Jenkins, and transfer function forecasts. The PREDl, 
PRED4. and PRED40 functions perform the same task for the regression models which 
required a transformation due to the presence of autocorrelation. The PREDl 
function was utilized in those cases where only AR(1) processes were detected during 
the regression model development stage. The PRED4 function performs a similar task 
for those cases in which AR(4) processes are the only form of autocorrelation. The 
PRED40 function makes the appropriate transformations for both AR(1) and AR(4) 
processes regardless of the order in which they were removed. 



- OLS\X2\Y \X1\XT ',X\XTXI iXTY \ K1\K\YHAT \YBAR\AY \ ERR\ET1^ 
Cl] R THIS FUNCTION COMPUTES THE REGRESSION STATISTICS OF AN 
C 2 ] fi INDEPENDENT VARIABLE AND A DEPENDENT VARIABLE . 

[ 3 ] fl THE FUNCTION SUPPLIES THE COLUMN OF ONES . 

C 4 ] ' ENTER THE INDEPENDENT VARIABLE ' 

[5] DL-^U 

C 6 ] ' ENTER THE DEPENDENT VARIABLE ' 

[7] OH^U 

[8] X2<-DL 

[ 9 ] Y^OH 
CIO] N<-pY 

cii] y-6-(iv,i)py 

C12] Xl-6-fVpl 

C13] Xr^(2,/7)p(Xl.X2) 

cm] x<-(s)xr 

CIS] XTxi^i(xr+.xx) 

C16] XT7-s-Xr+. xY 

C17] S^xrxi+. XXYY 
C18] Xl^pS 
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C19] 

[ 20 ] 

[ 21 ] 

[ 22 ] 

[23] 

[24] 

[25] 

[26] 

[27] 

[28] 

[29] 

[30] 

[31] 

[32] 

[33] 

[34] 

[35] 

[36] 

[37] 

[38] 

[39] 

[40] 

[41] 

[42] 

[43] 

[44] 

[45] 

[46] 

[47] 

[48] 

[49] 

[50] 

[51] 

[52] 

[53] 



;:<-ii:i[i] 

E-<rY-YHAT 

i?55-6-lp((IS|E)+.xE) 

S2*-RSSr(N-K) 

S*-S2-x0.5 

YBAR^( + / (.(.N)QY))iN 
AY^( iN)pY)-YBAR 
TSS*-( (N^pY^+.xAY 
ESS-<rTSS-RSS 
R2^ESSiTSS 

ADJR2^1-( (RSSi- (N-K) )*(.TSS* (N-1))) 

COVB^S2xXTXI 
STERRA*-COVB [ 1 ; 1 ] * 0 . 5 
STERRB-^COVBL2i21*0 .5 
TA*-BL1 111* (C0VBZU11*0 .5) 

TB*-B L2 1 11 *(COVB 12; 21*0 .5) 
F-<r(,R2*(.K-l))*Ul-R2)*(N-K')) 

ERR^iNlpE 

ET1^0,((N-1)*ERR) 

EET1*-(1*(ERR-ET1))*2 
ZJl-s-lp ((+/EET1 )*RSS) 

ET^*- 0 0 0 0 , ( (N-n ) *ERR ) 

EET^*-(U* CERR-ET ^- ) )*2 
DU*-lp ((+/EETt^)*RSS) 

P41-«-1-(0.5xD4) 

P42«e-(((iV*2)x(l-(0.5xD4)))+(iJ*2))*((/V*2)-(X*2)) 

P10*-(+/(l*(ERRxETl)))*(+/(l*~l*(ERRxERR))) 

' RSS ' , 5PPSS 

' ESS ' , ^ESS 

' TSS ' , ^TSS 

'S 

t t 

•ADJR2 ' ,^ADJR2 

t t 
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[54] 


'A ' 


, (®5[l ; ] ) 


[55] 


' STERRA 


' , ^STERRA 


[56] 


'B ' 


, (iJsS [2 ; ] ) 


[57] 


' STERRB 


' , ^STERRB 


[58] 


1 1 




[59] 


•P ' 


,^F 


[60] 


xTA ' 


' ,^TA 


[61] 


'TB ' 


' ,^TB 


[62] 


t t 




1 — 1 
(T> 
CO 
1 1 


•PI ' 


' ,iisPl 


[64] 


'PIP 


' ,^P10 


[65] 


1 1 




[66] 


'P4 1 


' ,®P4 


[67] 


ip41 


' , ijP41 


[68] 


ip42 


' ,i5P42 



V 



- GLS',X2\l\N',Yl%Y2%YlG\Y2G\X21\X22\X21G\X22GxXl',XT',X 
- 1 - fl THIS FUNCTION TRANSFORMS THE DATA TO REMOVE THE EFFECTS 
-2- THE VECTORS OF XG AND YG ARE THE 

-3 - (^TRANSFORMED VECTORS AND ARE GLOBAL VARIABLES . 

[ 4 ] ' ENTER THE INDEPENDENT VARIABLE ' 

C5] A^Q 

C 6 ] ' ENTER THE DEPENDENT VARIABLE ' 

C7] Z^U 

C 8 ] ' ENTER THE ESTIMATE OF P4 ' 

[9] P4-«-0 

[ 10 ] X2^A 

[ 11 ] Y^Z 

[ 12 ] N^pY 

[13] yl^4'^y 

[14] I2-s-(/7-4 )-M 

[15] YlC^yix((l-(P4*2))*0.5) 

[16] Y2G^in^Y)-(PnxY2) 
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[17] YG^Y1G,Y2G 

[18] Y-^iN,l)pYG 

[19] X21>e4+X2 

[ 20 ] X22-^(N-n)^X2 

[21] X21C>e-X21x((l-(P4*2))*0.5) 

[22] X22C'e-(4+X2)-(P4xX22) 

[23] XC-«-(X21C,X22C) 

[24] Xl>6-iVpl 

[25] XT^(.2,N)piXl,XG) 

[26] X^^XT 

[27] X2’XJ'6-g(X2’+.xX) 

[28] xry-6-xr+.xy 

[29] sc-6-xrxj+.xxrj 

[30] Kl-^pBG 

[31] X-6-Xl[l] 

[32] YHATG^X+.^BG 

[33] EG^Y-YHATG 

[34] RSSG^lpU>^EG) + .^EG) 

[35] S2G-<rRSSG*iN-K) 

[36] SG^S2G*0.5 

[37] YBARG^(+/ UN)pYG))*N 

[38] AYG^((N')pYG)-YBARG 

[39] TSSG^iiN)pYG) + .>^AYG 

[40] ESSG-^TSSG-RSSG 

[41] R2G^ESSG*TSSG 

[42] ADJR2G<rl-(, iRSSG* (N-K))* (TSSG* (N-l ) ) ) 

[43] COVBG-<-S2G^XTXI 

[44] STERRAG-^COVBGLl; 11*0 .5 

[45] STERRBG^COVBGL2; 21*0 .5 

[46] 2’;iC'6-fl(;[l;l] + (C07B(;[l;l]*0.5) 

[47] TBG-^BGL2ill*iCOVBGL2;21*0 .5) 

[48] FG^(R2Gi(.K-l))*((l-R2G)*(N-K)) 

[^9] ERRG^(N)pEG 

[50] ETlG^rO ,i(.N-l)'i-ERRG) 

[51] EETlG-^il^'iERRG-ETlG))*2 
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L521 DlG^lp(i+/EETlG)iRSSG) 

[53] 0 0 0 0 ,( (A/- 4 

[54] EET^G<-(n'\^(ERRG-ET>4G))*2 

[55] D^G-^lp((+/EET^G)^RSSG) 

[56] P41C^l-(0.5xC4(7) 

[57] P>^2G^((. (A7*2 ) x (1 - ( 0 . 5 xP4C ) ) ) + (K*2))i( (A/*2 )-{K*2 ) ) 

[58] P1C^(+/(14> (PPPCxPriC) ) (+/(14-"14- (PPPCxPPPC) ) ) 



[55] 


' RSSG ' 


, ^RSSG 


[60] 


' ESSG ' 


, ^ESSG 


[61] 


' TSSG ’ 


, ^TSSG 


[62] 


'SG ' , 


^SG 


[63] 


t 1 




[64] 


'ADJR2G 


' ,^ADJR2G 


[65] 


t 1 




[66] 


'AG ' , 


(5PC[1 ; ] ) 


[67] 


'STERRAG 


' ,^STERRAG 


[68] 


'EG ' , 


(5iSC[2;] ) 


[69] 


’ STERRBG 


' , ^STERREG 


[70] 


1 t 




[71] 


'FG ’ , 


^FG 


[72] 


'TAG ' 


,^TAG 


[73] 


'TEG ' 


, mTEG 


[74] 


1 t 




[75] 


'DIG ' 


, ^DIG 


[76] 


'PIG ' 


,^P1G 


[77] 


1 t 




[78] 


'P4C » 


,^DnG 


[75] 


'PHXG • 


, ®P41{7 


[80] 


'P42C * 


,5P42G 



V 



V GLSD \X2\Y \N \ll\12iX21',X22',Xl\XT \X\XTXI xXTY \KlxK 
[ 1 ] oTPIS FUNCTION CORRECTS TEE REGRESSION MODEL FOR THE 
[ 2 ] uTHE EFFECTS OF AR{1) , THE TRANSFORMED VECTORS XF AND 
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C 3 ] R YF ARE GLOBAL VARIABLES . 

[ 4 ] • ENTER THE INDEPENDENT VARIABLE ' 

C5] c-6-n 

C 6 ] ’ ENTER THE DEPENDENT VARIABLE ' 

C7] £)«6-D 

C8 ] ' ENTER THE VALUE OF PI ' 

C9] Pl«e-D 
CIO] X2-S-C 

[11] Yf-D 

[12] iV*e-pY 

[13] Y1-6-0,1 + Y 

[14] Y2-s-(/7-l )-hY 

[15] YlF-e-Ylx((l-(Pl*2))*0.5) 

[16] Y2P^(l+Y)-(PlxY2) 

[17] YP-e-l'KYlP,Y2P) 

[18] Y-s-(A^,l )pYF 

[19] X21«f-0,1+X2 

[20] X22-s-(W-l ) + X2 

[21] X21F-e-X21x((l-(Pl*2))*0.5) 

[22] X22F-s-(l+X2)-(PlxX22) 

[23] XF«s-l4'(X21F,X2 2F) 

[24] XI^NqI 

[25] XY-f-(2,/7)p(Xl,XF) 

[26] X^^XT 

[27] XYXJ-f-iCXF+.xX) 

[28] XPY-«-XP+.xY 

[29] BF^XTXI+.^XTY 

[30] Xl-e-pBF 

[31] X-e-XlCl] 

[3 2] YHATF<rX+ ,^BF 

[33] EF^Y-IHATF 

[34] P5SF-s-lp((<S>FF) + .xPF) 

[35] S2F^RSSFi(N-K) 

[36] SF-e-S2F*0.5 

[37] YBARF^i+/ UN)pYF))iN 
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[38] AYF^i(N)pIF)-YBARF 

[39] r55F^((/V)pIF) + .x;iyF 

[40] ESSF-^TSSF-RSSF 

[ 41 ] RIF-^tESSFtTSSF 

[4 2] ADJR2F^l-( (RSSFi- (N-K) )r (TSSFr (iV-1 ) ) ) 

[43] C07FF-S-52FXXFXJ 

[ 44 ] STERRAF^COVBFLl', 11 *0.5 

[ 45 ] STERRBF-^COVBF l2\ 21*0 .5 

[46] r.4F-s-5F[l ; 1] * (COVBFLl; 1] *0 . 5 ) 

[47] TBF-^BFL2illi (COVBFL2 i 21 *0 . 5 ) 

[48] FF^(F2Ft (X-1 ) )t( (l-F2F)r(iV-X) ) 

[49] ERRF-^iNlpEF 

[50] ETIF^O ,iiN-l)^ERRF) 

[51] EET1F*-(1^(ERRF-ET1F'))*2 

[52] DlF^lQ((+/EETlFlrRSSF) 

[53] F2’4F-6- 0 0 0 0 , ( (iV-4 )-^EFFF) 

[54] FF2’4F-5-(4 4'(FFFF-F2’4F) )*2 
[5 5] F4F-6-lp(( + /FF2’4F)^F55F) 

[56] P41F^l-(0.5x£)4F) 

[5 7] F4 2F*6-( ( {N*2 )x(l - (0 . 5 x£54F) ) )+(X*2 ) )+ ( (/\^*2 ) - (X*2 ) ) 

[58] FlF-e-(+/(l4(FFFFxEriF) ) )t (+/ (l + "l + (FPPFxFPPF) ) ) 

[59] 'RSSF ' ,^RSSF 

[60] 'FSSF ' ,^ESSF 

[61] 'TSSF ' ,^TSSF 

[62] »SF ',!5SF 

[63] ' » 

[6U] 'ADJR2F ' ,^ADJR2F 

[65] ‘ » 

[66] MF ',(<5flF[l;]) 

[67] 'STERRAF ' ,ts^STERRAF 

[68] 'FF »,(3!flF[2;]) 

[69] ' STERRBF ' , ^STERRBF 

[70] ' ' 

[71] 'FF ',$FF 

[72] 'TAF ' ,^TAF 
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' , ^TBF 



[73] 'TBF 

[74] ' 

[75] 'DIF ' ,JSD1F 

[76] 'PIF ' ,^P1F 

[77] ' ' 

[78] 'DnF ' ,^DUF 

[79] 'P41f ',®P41F 

[80] 'P42F ' ,$P42F 

V 



7 BRED ; YPA ; YAA ; YPM ; YAM ; NUM ; DENOM 
[ 1 ] flTPJS FUNCTION PROVIDES TEE MEASURES OF PREDICTIVE 
[ 2 ] fl EFFECTIVENESS FOR TEE BOX- JENKINS MODELS . 

[ 3 ] ' ENTER TEE A CTUAL SERIES VECTOR ' 

[4] OEl^U 

[ 5 ] ' ENTER TEE BOX- JENKINS FORECAST VALUES ' 

[6] yp-s-Q 

[7] y^^"4+0ff2 

[ 8 ] N^pYA 

[9] MilPF^C ( + /( ( I (YP-YA))iYA))iN)^100 

[10] RMSE<rU ( + /((YP-YA)*2 ))*N)*0.5 )*U+/YA)iN) 

[11] YPA^i+/YP)*N 

[12] YAA^(+/YA)iN 

[13] YPM^YP-YPA 

[14] YAM^YA-YAA 

[15] NUM^(+/(.YPMxYAM))iN 

[16] DENOM^(U+/ CYPM*2) )*N)^ (.(.+/ CYAM*2))*N'))*0, 5 

[17] CORR^NUM* DENOM 

[18] ' PREDICTED Y' 

[19] yp 

[ 20 ] ' ' 

[21] 'ACTUARY' 

[22] YA 

[23] ' ' 
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[24] 'CORRELATION ' ,^CORR 

[25] ' 

[26] 'RMSE/MA ' ,^RMSE 

[27] ' ' 

[28] 'MAPE ' ,^MAPE 

V 



V PRED 1 ; yPi4 iYAAi YPH ; YAM ; NUM ; DENOM 
[ 1 ] f^TRIS FUNCTION COMPUTES THE MEASURES OF PREDICTIVE 
[ 2 ] fl EFFECTIVENESS FOR THOSE CASES IN WHICH CORRECTIONS 
[ 3 ] R FOR FIRST ORDER A UTOCORRELATION ARE MADE , ' 

[ 4 ] ' ENTER THE VECTOR CONTAINING THE ACTUAL Y VALUES . ' 

[5] OHl^U 

[ 6 ] ' ENTER THE VECTOR CONTAINING THE ACTUAL X VALUES . ' 

[7] DL2^U 

[ 8 ] ' ENTER THE VALUE OF PI . ' 

[9] p^n 

[10] YA^~n^-OH2 

[11] X-e-~4-M)L2 

[12] X/V^4+ ("5+1)12 ) 

[13] YN<-5p0 

[14] yP-6-4pO 

[15] YiV[l]-6-l + ("5+(7P2 ) 

[16] L^O 

[17] ITERATE :L^L+1 

[18] YPLLl<-YNLL+ll<r(PxYNLLl ) + ( (X[L] - (PxX/7[i] ))xB) 

[19] -^(1^3) /ITERATE 

[ 20 ] YN^U'^^YN 

[ 21 ] N^pX 

[22] YP-6-(PxyiV) + ((X-(PxX/7))x5) 

[2 3] MAPE^U+/ U \ (YP-YA))*YA))iN)ylOO 

[24] RMSE^(U+/ UYP-YA )*2) )*N)*0 . 5 U+ZYA^^N) 

[25] YPA^i+ZYP^^N 

[26] YAA^i+ZYA^^N 
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[273 YPM->rYP-YPA 
[283 YAM^YA-YAA 
[293 NUM-^(+/ (YPMxYAM)nN 

[3 0 3 DENOM^(((+/(YPM*2))*N)x((-^./(YAM*2))iN))i<Q.5 
[313 CORR^NUMiDENOM 
[323 ' PREDICTED Y ’ 

[333 YP 
[343 ’ ' 

[3 53 'ACTUARY' 

[363 YA 
[373 ' ' 

[383 ' CORRELA TION ' CORR 
[393 ' ' 

[403 'RMSE/MA ' ,^RMSE 
[413 ' ' 

[423 'MAPE ' ,^MAPE 

V 



V PREDU ; YPA ; YAA ; YPM ; YAM ; NUM ; DENOM 
[ 1 3 (^THIS FUNCTION COMPUTES THE MEASURES OF PREDICTIVE 
[23 fl EFFECTIVENESS FOR THOSE CA SES WHERE CORRECTIONS FOR 
[ 3 3 P^FOURTH ORDER AUTOCORRELATION ARE MADE . 

[43 ’ ENTER A VECTOR CONTAINING THE ACTUAL Y VALUES . ' 

[53 OH2^a 

[63 ' ENTER THE VECTOR OF X VALUES ' 

[73 DL2^U 

[83 ’ ENTER THE VALUE OF P4 ' 

[93 P-^Q 
[103 YA^~n‘f-OH2 
[113 YN^'n'^OHl 
[123 X-^~n^DL2 
[133 XN-<r~n^DLl 
[143 N^pX 

[153 YP^(P^YN)+((X-(P^XN))xB) 
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[16] M^PF-s-((+/(( 1 (yP-Y^))-^Yin)*iV)xlQO 

[17] ((( + /( (YP-Y^ )*2 )) t/7)*0. 5 Jt((+/Y^)+/V) 

[18] YPA^i-^/YP)iN 

[19] YAA-^{^/YA)iN 

[ 20 ] YPM‘<rYP-YPA 

[21] YAM^YA-YAA 

[ 22 ] NUM^i^/ VIPM^YAM))iN 

[23] PEiVOM^((( + /(YPW*2))-t/7)x(( + /(Y^M*2))+/\/))*0.5 

[24] CORR^NUMiDENOM 

[25] ' PREDICTED Y' 

[26] YP 

[27] ' ' 

[28] 'ACTUARY' 

[29] YA 

[30] ' ' 

[31] 'CORRELATION ' ,^CORR 

[32] ' ' 

[33] 'RMSE/MA ' ,^RMSE 

[34] ' ' 

[35] 'MAPE ' ,mAPE 

V 



V PREDnO ; YPA ; YAA ; YPM ; YAM ; NUM ; DENOM 
[ 1 ] P THIS FUNCTION COMPUTES THE MEASURES OP PREDICTIVE 
[ 2 ] p EFFECTIVENESS FOR THOSE MODELS CORRECTING FOR 
[3 ] pPIPSr ORDER AND FOURTH ORDER AUTOCORRELATION : 

[ 4 ] ' ENTER THE VECTOR CONTAINING ACTUAL Y VALUES ' 

[ 5 ] OH2->rU 

[ 6 ] ' ENTER THE VECTOR CONTAINING X VALUES ' 

[ 7 ] DL2^a 

[8] 'ENTER A VALUE FOR Pn' 

[9] P4^D 

[10] ' ENTER THE VALUE FOR PI ' 

[11] Pl^D 
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C 1 2 ] ' ENTER TEE VECTOR CONTAINING THE PREDICTED VALDES ' 

[13] 

[14] 

[15] X^"4+£ii2 

[16] Xn^4^- C5 + PL2 ) 

[17] Xr4<s-4f (~8fPL2 ) 

[18] Xr5^4+ ( "9+15^2 ) 

[19] XS«s-Sx ( ( (X- (P4xX2’4 ) )- (PlxXTl ) ) + (Pi xP4xXT5 ) ) 

[20] yp^4po 

[21] ypi^spo 

[22] y2’l[l]-e-l+C5f<9P2) 

[23] 7r4-s-4+ (■8t(0P2 ) 

[24] ir5-6-4+ C9+OP2 ) 

[25] L<-0 

[26] ITERATE :L^L + 1 

[27] yP[L]«s-Jri[L + l]-*-(((Plxyri[L] ) + (P4xyp4[L] ))-(PlxP4xyr5[L] ))+Xi 

[28] ^ (L^ 3 ) /ITERATE 

[29] N<^pX 

[30] MAPP-*- ((+/(( I (yp-yA))*y;i))+^)xioo 

[31] RMSE^( ( ( +/ ( (yP-yA )*2 ) )+A?)* 0 . 5 )* (.(+/YA)tN) 

[32] ypA^(+/yp)^iv 

[33] YAA^(+/YA)^N 

[34] YPM^YP-YPA 
[3 5] YAM-^YA-YAA 

[36] NUM^i+/ (YPM^YAM))*N 

[37] DENOM-friU + /(YPM*2))*N)x(( + /(YAM*2))*N))*0 .5 

[38] CORR^NUMiDENOM 

[39] 'PREDICTED Y' 

[40] yp 

[41] ' ' 

[42] ' ACTUAL Y' 

[43] yA 

[44] ' » 

[45] ' CORRELA TION ' ,^CORR 

[46] ' • 
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[47] 'RMSE/MA 

[48] ' ' 

[49] 'MAPE 

V 



' , ^RMSE 
' , ^MAPE 
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